AN Ap-Aoo INEQUALITY FOR THE HUBERT TRANSFORM 

MICHAEL T. LACEY 



Abstract. We prove in particular that for the Hilbert transform, for 1 < p < oo and a weight 
w G A p , that we have the inequality 

I|H|| L p (wML p (w , < !|w||i / ;max{|| W ||^' ) || W -T'' +1 ||^} 



The case of p = 2 is an instance of a recent result of Hytonen-Perez, and as a corollary we 
obtain the well-known bound of S. Petermichl of ||w||™ '' p . This supports a conjectural 
inequality valid for all Calderon-Zygmund operators T, and p + 2. 



1. Introduction: Main Theorem 

We are interested in estimates for the norms of Calderon-Zygmund operators on weighted L p - 
spaces, a question that has attracted significant interest recently; definitive estimates of this type 
have been obtained in [H2, HLM*], among others. Our particular motivation here is the paper 
of Lerner [LI], and Hytonen-Perez [HP] that focus on a quantification of good estimates of the 
norm of an operator in terms of the A p and Aqo characteristics of a weight. 



1.1. Definition. Let w be a weight on R d with density also written as w. Assume w > a.e., 
and 1 < p < oo. We define cr = w v = w'-p , which is defined a. e. , and set 



w|| Ap := sup 



w(QJ 



p-i 



q IQI L |Q| 

For the endpoint p = oo, we set 

(1.2) \\w\\ A :=supw(Q) _1 M(wQ)dx. 
Q Jq 

Note that the definition of (1.2) is different from more familiar definition of an norm [H]. 
It originates in [W], and the article of [HP] makes a convincing case for it's central role in the 
subject. It is known that ||w|| Aoo < Cd||w||A p for 1 < p < oo. The article [HP] proves a very 
sharp estimate on the L 2 (w)-norm of Calderon-Zygmund operators in terms of the A2 and Aqo 
characteristics of the weight. The main purpose of this article is to extend this result to the case 
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ofp 4 2 for a few canonical Calderon-Zygmund operators. This result, stated just below, suggests 
a clear conjecture, which we return to in the concluding section of this paper. 

Let us say that T is a classical Calderon-Zygmund operator if it is of the form Tf(x) = 
p.v. J f [y )K(x — y) dy where (1) K(y) = tj -1 , namely the Hilbert transform; (2) K(z) = z~ 2 , 
for complex z, namely the Beurling operator; (3) powers of the Beurling operator; (4) K(y) = 
ylul ~ d -1 > ' n dimension d > 2, namely the Riesz transform; and lastly (4) K(-y) is any odd, one- 
dimensional C 2 Calderon-Zygmund kernel. By the latter, we mean that |9 e K(ij)| < | tj | 1 e for 
e = 0, 1,2. These operators are distinguished in that they are known to be in the convex hull of 
a Haar shift of bounded complexity. (See the next section for a definition.) 

We define the maximal truncations of T by 



Tf (x) := sup 

e<6 



f[y)K[x-y) dy 

e <l x -y|< 5 



This result is new for p 4- 2 with or without truncations; and in the case of p = 2, it is new for 
the maximal truncations. 

1.3. Theorem. Let T be a classical singular integral in the sense just defined, and 1 < p < oo, 
and w G A p . It then holds that 



m mw) < C T)P ||w||i / >ax{||w||^',|| W -T' +1 ||^; 



LPfwl 



Since we have ||w|| Aoo < ||w|| Ap , the result above contains the sharp estimate ||TJ LP ( W ) < 



H w llAp X ^' 1// ' P ~ 1 ' \ This last estimate holds in complete generality, and is a central estimate of 
[HLM + ]. There is a weak-L p norm analog (5.3) of our main inequality that is known in complete 
generality [HLM+, Section 12]. 

A key step in the proof is the analogous result for dyadic models of the operators in question, see 
§2. This is a step that has been critical in many contributions, beginning with the breakthrough 
result of [P2], and has taken an even more central role with the random Haar shifts in [H2]. An 
important component of this definition is a notion of complexity. 

We apply to the Haar shift operators the remarkable Lerner median inequality [LI], see §4. It 
is well-known that this inequality yields estimates that are exponential in complexity. This means 
that the main theorem of this section only applies to operators which are in the convex hull of 
shifts of bounded complexity. Fortunately, this is known to include the few canonical examples 
indicated above. For the Hilbert transform see [PI]; Beurling [DV]; powers of the Beurling [DPV]; 
Riesz transforms [PTV]; and for C 2 odd one-dimensional kernels, see [V]. It is an open problem 
to extend the main result of this last paper to higher dimensions. 



AN A- 



p-A^ INEQUALITY FOR THE HUBERT TRANSFORM 



3 



The Lerner median inequality has been applied in the setting of weighted inequalities, see 
[CUMP1], but we follow a more refined path here. After application of a the Lerner median 
inequality, we get a class of dyadic positive operators with particular structure. There is a very 
precise understanding of the norms of these operators, see [LSUT1], and we recall these results 
in §3, analyzing the particular operators of interest in that section. This analysis completes the 
proof. 

Acknowledgment. The genesis for the ideas herein begain from my lectures at the Spring School in 
Analysis, Paesky, Czech Republic. I thank the organizers for their efforts which lead to a successful 
School. 



We begin our dyadic analysis. By a grid we mean a collection V of cubes in R d with Q fl Q' G 
{0, Q, Q'} for all Q, Q' G V. The cubes can be taken to be a product of half-open half-closed 
intervals. We will say that V is a dyadic grid if each cube Q G V these two properties hold. 
(1) Q is the union of 2 d -subcubes of equal volume (the children of Q), and (2) the set of cubes 
{Q' G T> : |Q'| = |Q|} partition R d . We will work with dyadic grids below. 

We give a (standard) definition of Haar shifts. By a Haar function we will mean a function hi, 
supported on I, constant on its children, and orthogonal to Xi ( ar| d no assumption on normal- 
izations). And, by a generalized Haar function as a function hi which is a linear combination of 
Xi, and {xi' : I' G Child(I)}. Such a function supported on I but need not be orthogonal to 
constants. In the definition, and throughout the paper, £(Q) = |Q| 1//d is the side length of Q. 

2.1. Definition. For integers (m, n) G N 2 , we say that linear operator S is a (generalized) Haar 
shift operator of parameters (m, n) if 



where (1) in the second sum, the superscript (m ' n ' on the sum means that in addition we require 
£(Q') =2- m £(Q) and £(R') =2~ n £(Q), and (2) the function h§' is a (generalized) Haar function 
on R', and hp, is one on Q', with the joint normalization that 



2. The Haar Shift Result 




Q'.R'cQ 



(2.2) ||hQ/|U|h! 




< 1 . 
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In particular, this means that we have the representation 

Sf(x) = ^|Q|- 1 [ f(oj)s Q (x,y) dy 

where Sq(x,tj) is supported on Q x Q, with L°° norm at most one. We say that the complexity 
of S is k = max(m, n) . 

Particular examples include Haar multipliers, and the Haar operators central to the result of 
[PI], in which the Hilbert transform is obtained as a convex combination of a Haar shift of 
complexity 1. Notice that a Haar shift using only Haar functions is necessarily bounded on L 2 , 
with norm independent of the complexity type. Dyadic paraproducts are the single example in 
which one should use generalized Haar functions. The complexity type in this case could be taken 
to be of type (0,0), and is explictly 



Q 



',qf ■ Hq 



where | clq| < y |Q|, and Hq is a Haar function. In this case, we further assume that S is an 
L 2 -bounded operator. We comment that the analysis of Haar shifts has been central to the papers 
of [PI, LPR, HPTV, CUMP1] among several other recent publications. Their central role in this 
paper is expected. 

Define the maximal truncations of the Haar shift operator by 



ii,f (x) := sup 

e>0 



HI 



Q:«(Q)>e 



f(y)si(x,y) dy 



Our analysis extends to the two-weight setting, and we shift to it now. Given a pair of weights 
w, cr, we set the two weight A p characteristic to be 



sup 
Q 



r w(Q)n1/ P rg(Q) 
IQI J [ IQI 



1/p' 



To connect this to the classical A p setting, we would take o = w 1_p ', where we would then have 
[[w, cr]] Ap = ||w||^\ Also, in the norm inequalities in the remainder of the paper, we will have 
a certain asymmetry between w and o, one designed so that the inequalities behave well with 
respect to duality. To dualize, interchange the roles of w and a, and exchange p for its dual 
index. This very useful fact appears without comment below. 
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As we are discussing maximal truncations, the maximal function itself will appear, given by 



Mf (x) = sup(2t)~ 



t>o 



-t,t] 



d 



|f(x + -y)| dy 



The Ap-Aoo estimate for M have been obtained in [HP]; these considerations are less complicated 
than those for singular integrals. (One only needs Sawyer's two weight characterization [SI] for 
the maximal function.) 

2.3. Theorem. For the Maximal function M, index 1 < p < oo, and a pair of weights w, cr we 
have 

||M(fa)|| L p (w) < C p Iw,CT] A jCT||^||f|| LP((7 ). 

Observe that the lower bound of [[w, o"Ja p for the weak-type norm is entirely elementary, thus 
the estimate above clearly show that contribution of the Aoo norm to the strong-type norm. 

This estimate is equal to or smaller than those we are seeking to prove. Our main technical 
Theorem is then stated in the two weight language. The corresponding result for the classical 
operators is of course true. 

2.4. Theorem. For S an L 2 (R d ) bounded Haar shift operator of complexity k, index 1 < p < oo, 
and a pair of weights w, cr, it holds that 

l|S|,(of)|| L p (w) < C S)P Ew, o1a p {H|Ju + HlAi}ll f lkp(cT)- 

Our estimate on the operator norm will be exponential in the complexity parameter k, so 
that we will not attempt to quantify it. It is an open question if one can improve this estimate 
to polynomial in complexity, a question we return to in the concluding section of this paper. 
In particular, a positive answer would immediately imply that our main Theorem holds for all 
continuous Calderon-Zygmund operators. 

2.1. Deducing the Main Theorem. If T is a classical singular integral operator, we have the 
following representation for it. There is a measure space (CI, A), and (not necessarily positive) 
measure yi on (D., A), with < |u.|(Q) < oo, so that for almost all tu G Q, we have a Haar shift 
operator of complexity at most one, so that for f, g smooth compactly supported functions 
with dist(supp(f),supp(g)) > 0, 



(2.5) (Tf , g) = (K * f, g) 



(S^g) du.. 

a 
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We remark that if one is interested in the Hilbert transform [PI] or smooth one-dimensional odd 
Calderon-Zygmund kernels [V], then the measure \i can be taken positive. And indeed, for the 
Hilbert transform there is a strong form of the equality above [HI]. For the Beurling operator, 
however, p. will be complex valued [DV]. (And much of the difficulty in that paper is showing that 
the measure \i is non-trivial!) Thus, our main result, without truncations, follows immediately 
from the Haar shift version. 
Concerning truncations, setting 

§-f=2jQ|-i [ s£(x,y)f(y) dy 

let 8^ be the corresponding operator with sum restricted to those q with £(Q) > e. Apply (2.5) 
with f , g so that dist(supp(f ), supp(g)) > e. We see that the those Q with £(Q) < ce make no 
contribution in (2.5). From this, it follows that we will have 



(T e * f , g) 



<S?f , g) dp. 

a 



where T e is an operator with kernel K e (x,y) so that for |x — y| > Ce we have K e (x,y) equals 
K(x — y), the un-truncated kernel. Moreover, by the normalizing condition (2.2), it follows that 

|K e (x — y)\ < e~ d , |x — -y | < e . 

That is, for |x — y| < Ce, we have a kernel which at worst performs an average of f. From this, 
we see that we have 



Tf < Mf + 



f d||x| 

a 



In view of the fact that the maximal function obeys better bounds, see Theorem 2.3, our main 
theorem for the maximal truncation of classical singular integrals follows from that for Haar shifts. 

3. Dyadic Positive Operators 

We recall elements of the main results from [LSUT1], which codifies and extends the arguments 
of [S2,S3]. The latter papers of Sawyer characterized the strong-type two-weight inequalities for 
the fractional integrals, setting out important elements of the two-weighted theory. 

Let t = {tq : Q E V) be non-negative constants, and define linear operators by 

T T f := Y_ T Q ' E Q f ' !q > 

Qet> 

Here, we are defining the operator S a and a 'localization' of S a corresponding to a cube R. 
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Below, we consider the L p (cr) to L p (w) mapping properties of T x , where 1 < p < oo. First, 
we have the weak-type inequalities. 

3.1. Theorem. Let r be non-negative constants, and w, o weights. Let 1 < p < oo. We have 
the equivalence below. 



||T T (ff-)l|LP(cr)MLP.<»(w) - T p /(w, a) := supw(R) 1/p ' Y T Q ' E Q f ' 1 Q 

QcR 

There is a corresponding, harder, strong-type characterization. 



LP '(a) 



3.2. Theorem. Under the same assumptions as Theorem 3.1 we have the equivalences of norms 
below. 

(3.3) ||T T (ff-)||LP(ffHLP(w) - T p /(w,a) + T p (a,w) . 

Notice that the strong type norm is controlled by the larger of two weak-type norms. 

3.1. A Particular Class of Operators. We consider a class of operators, motivated by our 
upcoming application of Lerner's median inequality. 

3.4. Definition. We say that a collection C of dyadic cubes is type L if this condition holds. We 
have for a constant A = A c > so that 



(3.5) supE Q exp(A _1 Y_ 1q) < 1 



Q'e£ : Q'CQ 



For such a collection C, define T^f := Y.Qec 1q ' E Q~f- 

A simple sufficent condition for (3.5) is e.g. that U{Q' e C : Q' c Q} < 1|Q|. The 
advantage of the condition above is that it is the minimal condition needed to complete our 
proof, and it conveniently quantifies our estimate. 

The rationale for this definition will be come clear after the discussion in §4. But note that the 
collection C is 'thin' in that the /^-children of any cube in the collection must be 'thin.' This notion 
of thinness depends upon the constant Ac', we will ha ve to take this constant to be exponentially 
large in the complexity of the Haar shift we consider. 

3.6. Proposition. For a collection of dyadic cubes that is of type L, 1 < p < oo, and a pair of 
weights w, cr, we have 

(3.7) ||Tc(-ff)||i.p( ff ) 1 _ > i.p,» (w ) < AEw, olAplMlJ^' , 

(3.8) ||Tc(-(T)|| L p( ff )_» L p (w ) <, A[w,ff] Ap max{||ff||J£, ||w||J£ } . 
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We take up the proof of this Proposition. Of the two estimates (3.7) and (3.8), it suffices to 
prove the weak-type result (3.7). Indeed, it is the content of (3.3), that the strong-type norm of 
Tjc is characterized by the maximum of the two weak-type norms, from L p (cr) to weak-L p (w) and 
from L p '(w) to weak-L p '(a). In so doing, we should keep track of the role of the measures and 
index p. Thus, 

||Tc(-ff)|| L P(cr) £ ||Ic(-0-)|| L P(cr) i->I_P'°°(w) + ||T£(" w )Hlp'(w)>->t_p'.° (o-) 

< A{[w, ol A Ja\\Yl + Ia,w3 Ap; ||w||^'} . 

Then, (3.8) follows as the two-weight A p terms above are equal. 
In particular, it suffices to show that for any cube Qo, we have 



(3.9) 



QcQo 



V Eqfw ■ 1q <A[a,wI A ||w||^w(Q ) 1/p '. 

z — If' (a) v 



The steps below are the argument pioneered in [LPR], and have been used in [H2, HPTV, 
HLM + , HP]. The details have not been presented before in the positive case, where they are 
much simpler. We make the definition of the stopping cubes. 

3.10. Definition. Let P be a grid, w a weight. Given a cube Q G V, we set the stopping 
children of Q, written T(Q), to be the maximal dyadic cubes Q' C Q for which w(Q')/|Q'| > 
4w(QJ/|Q|. A basic property of this collection is that 

(3.H) Y_ |Ql<ilQ|. 

Q'eT(Q) 

We set the stopping cubes of Qo to be the collection S = Ui>o<Sj(Q)> where we inductively 
define S (Qo) := {Qo}, and Sj +1 (Q ) = UQ'es-(Q ) T[Q). Thus, these are the maximal dyadic 
cubes, so that passing from parent to child in S, the average value of w is increasing by at least 
factor 4. 

We are free to assume that Q C Qo for all Q G £, where Q is as in (3.9). Let us fix a non- 
negative integer a with 2 Q < 2|[w, cr] A , and integer b > 0, take £ Q)b (S) be those Q G C so that 

(a) S G S{Q ) is the smallest stopping cube which contains Q; (b) 2 a ~ 1 < ^[^f ^ < 2 Q ; 
and (c) 2~ b+1 Esw < Eqw < 2 _b+2 Esw. Notice that a holds the A p ratio fixed, and by definition 
of the stopping cubes, the sets £ ab (S) exhaust C as the three quantities S G 5(Qo), and integers 
a, b vary. 
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Setting £ Q)b = 



UseS(Q )£ Q ,b(S), it holds that 



(3.12) Y_ !q e Q w G ( dx ) ^ 2 ~ V 

Jq ° QG£ Q ,b 
QcQo 



'b 



W A, 



2 a[ v'- y] wlQ ) . 



This is summed over < a < [log 2 [[w, cr|^] + 1 and non-negative b to prove (3.9). 
The essence of the proof of our claim (3.12) is this distributional estimate. 

3.13. Lemma. With the notations above we have these two distributional inequalities, universal 
over (1) integers a, b,t; and (2) S G 5(Qo) 3nd (3) measure v equal to Lebesgue measure or o 



Qe£ a , b (S) 
Here, K is a constant. 

Proof. Fix data of the Lemma, and note that it suffices to prove that for a maximal Qo G £ Q ,b(S), 
that the estimate above holds on the cube Q . Then, (3.14), for v being Lebesgue measure, 
obviously reduces to 



Q6£„, b (S) 

which is an immediate consequence of C being of type L, see (3.5). So, we consider the case of 
y being cr. But note that with t > 1 fixed, the event 



is a union of disjoint cubes in collection S t C £ a ,b- Now, for each Q G S t , we have held the 
A p characteristic essentially constant, namely about 2 Q . And Eqw is a held to be approximately 
2~ b E s w. Thus, cr(Q) is, up to a fixed multiple, a constant times |Q|. Call this multiple p, which 
is a function of the stopping cube S, but the latter is held fixed. Hence, 







QG£ Q ,b(S) 



Qe£t 



pJjQISpe-tjQ, 



o| ~ e MQo) , 



Qe£ t 



Our proof is complete. 



□ 
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We can now verify (3.12). We will write 

Y_ E Q w • 1 Q < Y_ 2~ b E s wU s 

Qe£a, b Se5(Q ) 

where Us := 2^Qe£ ab (s) 1q- There is one more variable that is useful for us to hold essentially 
constant. Define an event E s , = {U s < A2 n+1 }, and for n > 0, set E s , n = {2 n A < U s < 2 n+1 A}. 
Then, we have ff(Es, n ) ^ e~ c2n cr(S) for c = Cc > 0. Moreover, we can estimate, using a familiar 
trick, 



SeS(Q ) 



,/ ,_ oo 



Y_ L 2- b - n/ ^ +n ^'E s w-U s l ES)n 

n=0 SGS(Qo) 



<_^2 n Y_ 2- b E s w • U s l Es , n 

n=0 SGS(Qo) 
oo 

<^2 n Y_ 2- b E s w-U s l Esn 

n=0 SeS(Qo) 



The first line follows by an appropriate choice of Holder's inequality, and the second as the sum, 
for each point x G Q is a super-geometric series of numbers. This is wasteful in n, but by (3.14), 



2 n 



2~ b E s wU s 



ff(dx) < A? 2- b P - n 



< iP'2 _bp ' +Qfp '- 1)_n w(S) . 



The last line follows by trading out the A p characteristic, which is approximately 2 Q on these 
stopping cubes. We can of course trivially sum this in n > 0. Now, the Aoo property is decisive. 
We employ the elementary property (3.11) to estimate 



seS(Qo) 



< 



2- lsT lsdx 

Q °SG-S(Q ) 1 1 

M(wl Qo ) dx< ||w|| Aoo w(Qo] 

Qo 



Combining estimates, we have proved (3.12). 
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4. Lerner's Median Inequality; Application to Haar Shift Operators 

We recall definitions for the inequality from [LI], which applies to a measurable function 4> on 
R d , and cube Q. A median of <p restricted to Q, is a possibly non-unique real number such that 

max{|{x e Q : 4>(x) > m^QJH, max{|{x e Q : <\>(x) > m*(Q)}|}|} < ||Q|. 

For parameter < A < 1 , we define an measure of oscillation of f to be 

a) A (4>;QJ := mf((4>-c)l Q )*(A|Q|). 

Here, (()* denotes the non-increasing rearrangement, so that if cf) is supported on Q, cj)*(A|Q|) is 
the A th percentile of 4>. The local sharp maximal function of f is 

M». Q cMx) := sup 1q,cu a (4>,Q'). 

Q'CQ 

4.1. Theorem (Lerner). Let <J) be a measurable function on R d , and Qo a cube. Then, there is 
a collection of cubes {Qj} all dyadic cubes contained in Qo so that 

(1) We have the pointwise inequality 

oo 

i<Kx) - m*(Qo)i < m; /4Qo 4>(x) + y_ Y- ^-^(f, q;)i Qj ? w 

<=1 j 

M//?e/-e Q /s t/?e parent of dyadic cube Q. 

(2) T/?e cubes Q- are disjoint in ), with k r/xec/. 

(3) Setting Cl k = Uj Qj, we /)a\/e O k+1 C O k , 

(4) |Q;na k+1 |<i|Q{|. 

The collection C = {Q- : j,£ > 1} is a collection of dyadic intervals of type L, with Ac — 1. 
We will say that M. C C has generations separated by t if it is a subset of 

{Qf : j > 1 , £ = t' mod t}, < t' < t . 

We turn to the application of this inequality to Haar shift operators S^. We will show that we 
can dominate by a sum of dyadic positive operators of type L. The sum has a number of terms 
in in controlled by complexity. Then our technical Theorem 2.4 follows from Proposition 3.6. In 
order to control the measure of oscillations above, as it now standard in the subject, we appeal 
to a weak-L 1 estimate. 

4.2. Lemma. Let S be an L 2 bounded Haar shift operator of complexity k. We then have 

Wh,oo < K||f||i. 
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Aside from the linear bound in complexity, this is a standard argument; details can be found 
in [H2, Proposition 5]. We need to make these comments on the estimation of the oscillation 
terms above for Haar shift operators. Recall that S is a Haar shift operator of complexity k. 
Fixing cube Q, and letting Q' K ' be the K-fold parent of Q, it follows that if measure g is not 
supported on Q' K ', that the function St,(g) is constant on Q. (Note that this is certainly not 
true for continuous Calderon-Zygmund operators, so this proof seems to be limited to the dyadic 
setting.) Constants do not contribute to the measure of oscillation that we are concerned with, 
therefore, in estimating u) 2 -d-i (St, (fa), Q), we can assume that f is supported on Q' K '. Moreover, 
in seeking to estimate this osciallatory term, we can group all the scales inside Q, and appeal to 
the weak-type estimate. For the K-scales above Q, we use the size condition (2.2). From this, 
we see that 



K+1 



xgQ : S^(fl (Q) ( K )CT) > KKE Q |f|a + ^E (Q)( t)|f|a 



< 2 



-d-2 



IQI 



t=i 

for a dimensional constant K. Hence, we conclude that 

K 

oj 2 -d-2(S h (fff),Qj) < KE Q |f|a + ^E (Q)(t) |f|o. 

t=i 

From this, it follows that we have the following estimate on the local sharp maximal function, 
M^S^oT) < KM|f|ff. 

Now, let f be supported on a fixed dyadic cube Q,. We apply Theorem 4.1 to St, (f cr) , restricted 
to a cube Qo, much larger than Qi. We can estimate 



(4.3) S h (fff)-a) 1/2 (S h (fff),Q ; 



< M«. Q ( Vf) + Y_ Y- ">2-d-2(3*(fa), Q})l Q i 



We have already seen that the local sharp function is bounded by KM(fcr). The structure of the 
Haar shift operator shows that 



a) 2 -d-2(S b (fo-),Q. £ ) < KE 6t o-| 



^E (Q p, t) o-|f| 

t=2 



We have again used Lemma 4.2, for the first term on the right. 
We will show that 



(4.4) 



^^E (Q p(t)CT|f] • 1q< 
£=1 j 



<C p>t [[w,CT] Ap {H|^' + ||cT||^}||f|| L p ((T 



iVp- 



LP (w) 
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This shows that the right hand side of (4.3) is bounded with a norm estimate that depends upon 
complexity. (It will be exponential.) Assuming that f is compactly supported, and taking Qo 
arbitrarily large, we can make m^Qo) as small as we wish. So by Fatou Theorem, we will have 
finished the proof. 

The maximal function obeys our estimate, see Theorem 2.3, bringing our focus to the remaining 
terms in (4.3). The main point is this: The term in (4.4) is dominated by an operators of type L, 
with constant A L < 2 td . From this, and (3.8), the required estimate (4.4) follows immediately. 

Fix 1 < t < k + 1, and note the following. Fix a cube R, and consider 1Z, the collection of 
those Q- G C such that (Qj)'*' = R. The collection 1Z consists of disjoint cubes. This means 
that below, we can work with a set of indices G K with the defining property of K being 



that for all pairs of integers Q- G M. there is a unique 
argue that this operator is of type L with constant Aj_ < 2 td . 

/ E (Q E )( t)|f|0- ■ l(Q«)(t) 

Indeed, for any cube R, we have 



with (Qj) (t) = (QK) W . We 



-(Qf)W 



(«,j)GK 



< 2 



Ld 



< 2 td |R 



Q CR 



by property (4) of Theorem 4.1. This estimate is uniform in R, hence, by a well-known John- 
Nirenberg arugment, it shows that A L < 2 td , completing our proof. 

4.5. Remark. Our approach gives a proof of the weak-type estimate (5.3) from Lerner's inequality, 
one of the few inequalities missing from the papers [CUMP2, CUMP1]. One should note that the 
weak-type inequality for the dyadic positive operators is quite easy. See [LSUT1,S2]. 

5. Concluding Remarks 
Our motivation for writing this paper is to present positive evidence for this conjecture. 

5.1. Conjecture. For T an L 2 (R d ) bounded Calderon-Zygmund Operator and 1 < p < oo, and 
wG A p it holds that 



IV||lp{w) < CT,p||w||^max{||w||^',||w 



,-p'+1||Vp 



LP(w) 



Currently, this is known for the un-truncated operator and p = 2, [HP]. For the definition of a 
Calderon-Zygmund operator, we refer the reader to op. cit. An attractive part of this conjecture, 
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pointed out to the author by Tuomas Hytonen, is that for canonical examples of T, it is a standard 
part of the subject to have a lower bound on ||T||lp(w)i-+i.t>(w) of ||w||]^ . Thus, the form of the 
estimate above quantifies the Aoo contribution to the norm. 

The corresponding weak-type result is contained in [HLM + , Theorem 12.3], recalled here as it 
seems to be the strongest known estimates, in the case of p 2. 

5.2. Theorem. Fori an L 2 (R d ) bounded Calderon-Zygmund Operator and 1 < p < oo, 

(5.3) ||T[,f|| L p,cx» M < C T , p ||w||]^||w||j^ ||fHip(w)) 

(5.4) ||V|| L p (w) < Ct^IIwII^HwII^' + ||f 

It is a curious remark that the extrapolation estimates in [HP] do not give better than the 
estimate above, despite extrapolating from the sharp L 2 estimate. Likewise, our Main Theorem 
cannot be proved by the same elementary arguments from [CUMP1]. 

What is required to give a proof of the conjecture? In the singular integral case, we do not have 
a principle matching that of Sawyer's observation in the positive operator case that the strong 
type norm is dominated by the maximum of two weak type norms. Instead, the best results in the 
two weight case are contained in [LSUT2, LSUT3, HLM + ]. The cleanest and simplest argument in 
[HLM + , Theorem 4.7], which proves a general two-weight estimate for Haar shift operators that is 
linear in complexity. It is is largely satisfying, except for the presence of the 'non-standard' testing 
condition (4.8). Indeed, the contribution of the non-standard testing condition to the estimate in 
(5.4) is the term ||w||]^ by which we miss the conjecture. Any essential strengthening of this 
Theorem would be interesting, and a potential step towards proving the conjecture above. 
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